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Multiple Loop Self-Triggered Model Predictive 
Control for Network Scheduling and Control 

Erik Henriksson, Daniel E. Quevedo, Edwin G.W. Peters, Henrik Sandberg, Karl Henrik Johansson 


Abstract —We present an algorithm for controlling and 
scheduling multiple linear time-invariant processes on a shared 
bandwidth limited communication network using adaptive sam¬ 
pling intervals. The controller is centralized and computes at 
every sampling instant not only the new control command for a 
process, but also decides the time Interval to wait until taking 
the next sample. The approach relies on model predictive control 
ideas, where the cost function penalizes the state and control 
effort as well as the time Interval until the next sample is 
taken. The latter is introduced in order to generate an adaptive 
sampling scheme for the overall system such that the sampling 
time increases as the norm of the system state goes to zero. 
The paper presents a method for synthesizing such a predictive 
controller and gives explicit sufficient conditions for when it is 
stabilizing. Further explicit conditions are given which guarantee 
conflict free transmissions on the network. It is shown that 
the optimization problem may be solved off-line and that the 
controller can be implemented as a lookup table of state feedback 
gains. Simulation studies which compare the proposed algorithm 
to periodic sampling illustrate potential performance gains. 

Index Terms —Predictive Control; Networked Control Systems; 
Process Control; Stability; Scheduling; Self-triggered Control. 

I. Introduction 

Wireless sensing and control systems have received in¬ 
creased attention in the process industry over the last years. 
Emerging technologies in low-power wake-up radio enables 
engineering of a new type of industrial automation systems 
where sensors, controllers and actuators communicate over a 
wireless channel. The introduction of a wireless medium in 
the control loop gives rise to new challenges which need to 
be handled IH. The aim of this paper is to address the problem 
of how the medium access to the wireless channel could 
be divided between the loops, taking the process dynamics 
into consideration. We investigate possibilities to design a 
self-triggered controller, that adaptively chooses the sampling 
period for multiple control loops. The aim is to reduce the 
amount of generated network traffic, while maintaining a 
guaranteed level of performance in respect of driving the initial 
system states to zero and the control effort needed. 

Consider the networked control system in Fig. [T] which 
shows how the sensors and the controller are connected 
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Fig. 1. Actuators A and processes V are wired to the controller C while the 
sensors S communicate over a wireless network, which in turn is coordinated 
by the Network Manager. 

through a wireless network. The wireless network is controlled 
by a Network Manager which allocates medium access to 
the sensors and triggers their transmissions. This setup is 
motivated by current industry standards based on the IEEE 
802 . 15.4 standard, e.g., Q, 13, which utilizes this structure 
for wireless control in process industry. Here the triggering is 
in turn generated by the controller which, in addition to com¬ 
puting the appropriate control action, dynamically determines 
the time of the next sample by a self-triggering approach Q. 
In doing so the controller gives varying attention to the loops 
depending on their state, while trying to communicate only 
few samples. To achieve this the controller must, for every 
loop, trade control performance against inter sampling time 
and give a quantitative measure of the resulting performance. 

The main contribution of the paper is to show that a self¬ 
triggering controller can be derived using a receding horizon 
control formulation where the predicted cost is used to jointly 
determine what control signal to be applied as well as the time 
of the next sampling instant. Using this formulation we can 
guarantee a minimum and a maximum time between samples. 

We will initially consider a single-loop system. We will then 
extend the approach to the multiple-loop case which can be 
analyzed with additional constraints on the communication 
pattern. The results presented herein are extensions to the 
authors’ previous work presented in 13. These results have 
been extended to handle multiple control loops on the same 
network while maintaining control performance and simulta¬ 
neously guarantee conflict free transmissions on the network. 

The development of control strategies for wireless automa¬ 
tion has become a large area of research in which, up until 
recently, most efforts have been made under the assumption 
of periodic communication 0. However the idea of adaptive 
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sampling is receiving increased attention. The efforts within 
this area may coarsely be divided into the two paradigms of 
event- and self-triggered control. In event-triggered control, 

e.g., Q, ii, ii, Eoi, mi], Ha, uni, ina, ina the sen¬ 
sor continuously monitors the process state and generates a 
sample when the state violates some predefined condition. 
Self-triggered control, e.g., JH, IfT^ , IflTl , IfTSlI , utilizes a 
model of the system to predict when a new sample needs 
to be taken in order to fulfill some pre-defined condition. A 
possible advantage of event- over self-triggered control is that 
the continuous monitoring of the state guarantees that a sample 
will be drawn as soon as the design condition is violated, thus 
resulting in an appropriate control action. The self-triggered 
controller will instead operate in open-loop between samples. 
This could potentially be a problem as disturbances to the 
process between samples cannot be attenuated. This problem 
may however be avoided by good choices of the inter-sampling 
times. The possible advantage of self- over event-triggered 
control is that the transmission time of sensor packets is known 
a-priori and hence we may schedule them, enabling sensors 
and transmitters to be put to sleep in-between samples and 
thereby save energy. 

The research area of joint design of control and communica¬ 
tion is currently very active, especially in the context of event- 
triggered control. In IT^ a joint optimization of control and 
communication is solved using dynamic programming, placing 
a communication scheduler in the sensor. In ||20|, ll2Tll and ll22] 
the control law and event-condition are co-designed to match 
performance of periodic control using a lower communication 
rate. In ||2^ this idea is extended to decentralized systems. 
The use of predictive control is also gaining popularity within 
the networked control community ll24ll . Il25] . 1261, lIZTll . In 
1281 predictive methods and vector quantization are used to 
reduce the controller to actuator communication in multiple 
input systems. In l2^ model predictive control (MFC) is used 
to design multiple actuator link scheduling and control signals. 
There have also been developments in using MFC under 
event-based sampling. In IMl a method for trading control 
performance and transmission rate in systems with multiple 
sensors is given. In ED an event-based MFC is proposed 
where the decision to re-calculate the control law is based 
on the difference between predicted and measured states. 

The problem addressed in the present paper, namely the 
joint design of a self-triggering rule and the appropriate control 
signal using MFC has been less studied than its event-triggered 
counterpart. In ll^ an approach relying on an exhaustive 
search which utilizes sub-optimal solutions giving the control 
policy and a corresponding self-triggering policy is presented. 
In lf3^ it is suggested that a portion of the open loop trajectory 
produced by the MFC should be applied to the process. The 
time between re-optimizations is then decided via a self¬ 
triggering approach. 

The approach taken in this paper differs from the above 
two in that the open loop cost we propose the MFC to solve 
is designed to be used in an adaptive sampling context. Further 
our extension to handle multiple loops using a self-triggered 
MFC is new. So is the guarantee of conflict free transmissions. 

The outline of the paper is as follows. In Section |II] 


the self-triggered network scheduling and control problem is 
defined and formulated as a receding horizon control problem. 
SectionUnipresents the open-loop optimal control problem for 
a single loop, to be solved by the receding horizon controller. 
The optimal solution is presented in Section |IV] Section El 
presents the receding horizon control algorithm for a single 
loop in further detail and gives conditions for when it is stabi¬ 
lizing. The results are then extended to the multiple loop case 
in Section [yn] where conditions for stability and conflict free 
transmissions are given. The proposed method is explained and 
evaluated on simulated examples in Section IVIIII Concluding 
discussions are made in Section HXl 

II. Self-Triggered Networked Control 
Architecture 

We consider the problem of controlling s > 1 processes 
Vi through Vs over a shared communication network as in 
Fig. [D The processes are controlled by the controller C which 
computes the appropriate control action and schedule for each 
process. Each process Vi is given by a linear time-invariant 
(LTI) system 

xt{k+ 1) = Aixt{k) + Biui{k), 

xi{k) &W^^,ue{k) 

The controller works in the following way: At time k = k^, 
sensor transmits a sample Xi{ki) to the controller, which 
then computes the control signal Ui{ki) and sends it to the 
actuator Ae. Here £ G {1,2,..., s} is the process index. The 
actuator in turn will apply this control signal to the process 
until a new value is received from the controller. Jointly with 
deciding ui{ki) the controller also decides how many discrete 
time steps, say Ii{ki) G N’*' = {1, 2,... }, it will wait before 
it needs to change the control signal the next time. This value 
Ie{ki) is sent to the Network Manager which will schedule the 
sensor Se, to send a new sample at time k = kg + Ig{kg). To 
guarantee conflict free transmissions on the network, only one 
sensor is allowed to transmit at every time instance. Hence, 
when deciding the time to wait Ie{ke), the controller must 
make sure that no other sensor Sq, q ^ £, already is scheduled 
for transmission at time k = kg + Ig{kg). 

We propose that the controller C should be implemented as 
a receding horizon controller which for an individual loop £ 
at every sampling instant k = kg solves an open-loop optimal 
control problem. It does so by minimizing the infinite-horizon 
quadratic cost function 

f \\xg{kg + 1 )\\q^ + \\ug{kg + 

subject to the user defined weights Qg and Rg, while taking 
system dynamics into account. In SectionHIIlwe will embellish 
this cost function, such that control performance, inter sam¬ 
pling time and overall network schedulability also are taken 
into consideration. 

Remark 1. The focus is on a networked system where the 
network manager and controller are integrated in the same 
unit. This means, that the controller can send the schedules, 
that contain the transmission times of the sensors, directly to 
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Fig. 2. The prediction horizon. Here typical signal predictions are shown 
with I{k) = 3 and p = 5. 


the network manager. This information needs to be transmitted 
to the sensor node such that it knows when to sample and 
transmit. For example, using the IEEE 802.15.4 superframe, 
described in this can be done without creating additional 
overhead. Here the network manager broadcasts a beacon in 
the beginning of each superframe, which occurs at a fixed time 
interval. This beacon is received by all devices on the network 
and includes a schedule of the sensors that are allowed to 
transmit at given times. 

III. Adaptive Sampling Strategy 

For pedagogical ease we will in this section study the case 
when we control a single process on the network allowing us 
to drop the loop index £. The process we control has dynamics 

x{k + 1) = Ax{k) + Bu{k), x{k) e M”, u{k) G R"* (2) 

and the open-loop cost function we propose the controller to 
minimize at every sampling instant is 

OO 

J{x{k),i,U) = - + X! 

where a G R"*" is a design variable that is used to trade 
off the cost of sampling against the cost of control and 
i = I{k) G {l,2,...,p} is the number of discrete time 
units to wait before taking the next sample, where p G N’*' 
is the maximum number of time units to wait until taking 
the next sample. Further, the design variables 0 < Q 
0 < i? are symmetric matrices of appropriate dimensions. We 
optimize this cost over the constraint that the control sequence 
U — {u{k),u{k + 1),...} should follow the specific shape 
illustrated in Fig. |2] for some fixed period p. The period p 
is the maximum amount of time steps the sensor is allowed 
to wait before taking the next sample. If p = 1, a sample is 
taken at every time instance and a regular receding horizon 
cost is obtained. If one on the other hand would select a 
large value of p, very long sampling intervals can be obtained. 
This can though affect the control performance significantly 
in case there are disturbances in the system that make the size 
of the state increase. Thus, the number of discrete time units 
i = I{k) G {1, 2,... ,p} to wait before taking the next sample 
x{k + i), as well as the levels in the control sequence U are 
free variables over which we optimize. Note that neither the 
state nor the control values have a constrained magnitude. 



The constraint on the shape of the control trajectory U is 
motivated by the idea, that this sequence is applied to the 
actuator from time k until time instant k+i. At time x{k+i) we 
take a new sample and redo the optimization. By this method 
we get a joint optimization of the control signal to be applied 
as well as the number of time steps to the next sampling 
instant. The reason for letting the system be controlled by a 
control signal with period p after this is that we hope for the 
receding horizon algorithm to converge to this sampling-rate. 
We will provide methods for choosing p so that this happens 
in Section |V] The reason for wanting convergence to a down 
sampled control is that we want the system to be sampled at 
a slow rate when it has reached steady state, while we want 
it to be sampled faster during the transients. 

If one includes the above mentioned constraints, then Q 
can be rewritten as 


J{x{k),i,U{i)) ^ f \\x{k -f OIIq + lk(fc)llfl ) 

* i=o ^ ^ 

OO / P-1 / 
r —0 \0 ^ 

where i and U{i) = {u{k), u{k -f i), u{k + i + p ■ p),...}, 
rj G N+, are the decision variables over which we optimize. 
The term a/i reflects the cost of sampling. We use this cost 
to weight the cost of sampling against the classical quadratic 
control performance cost. For a given x{k), choosing a large a 
will force i to be larger and hence give longer inter sampling 
times. By the construction of the cost, we may tune Q, R 
and a via simulations to get the desired sampling behaviour 
while maintaining the desired control performance. One could 
imagine a more general cost of sampling. Here however we 
found a/i sufficient to be able to trade control performance 
for communication cost, see simulations in Section IVIII-BI 


IV. Cost Function Minimization 


Having defined the open-loop cost @ we proceed by 
computing its optimal value. We start by noticing that, even 
though we have a joint optimization problem, we may state it 
as 


minimize (minimize J{x{k),i,ll{i))]. (5) 

i V Uii) / 


We will use this separation and start by solving the inner 
problem, that of minimizing J{x{k),i,l4{i)) for a given value 
of t. In order to derive the solution, and for future reference, 
we need to define some variables. 


Definition 1. We define notation for the lifted model as 


i— 1 

A^^=A\ 

and notation for the generalized weighting matrices associated 
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to (O as 

g(i) ^ 

i?w = + i? 

TVW = Ar(*- 1 ) + 

w/zere z G {1,2,... ,p}, = Q, = i? ant/ = 0. 

Using Definition [T] it is straightforward to show the follow¬ 
ing lemma. 

Lemma 1. It holds that 

^ (Wxik + 1)\\q + \\u{k)\\l] 

1^0 ^ ^ 

= x{kY' Q^^\{k) + u{k)'^ u{k) + 2x{k)'^ u{k) 
and x{k + i) = A^'^'>x{k) + B^'^^u{k). 

Lemma 2. Assume that 0 < g, 0 R and that the pair 
(A^P) ^ B controllable. Then 


where also L^pI is given by Lemma \2] 

Proof. The proof is given in Appendix |B] □ 

Now, getting back to the original problem (|5]l. Provided that 
the assumptions of Theorem [T] hold, we may apply it giving 
that 

min J{x{k),i,U{i)) =m\n (—+ |la:(fc)||L.) ]•. 
i,U{i) ill ^ ) 

Unfortunately the authors are not aware of any method to solve 
this problem in general. If however i is restricted to a known 
finite set 1° C N+ we may find the optimal value within 
this set for a given value of x{k) by simply evaluating “ -|- 
Mk)\\U Vi G and by this obtaining the i which gives 
the lowest value of the cost. This procedure gives the optimum 
of®. Note that the computational complexity of finding the 
optimum is not necessarily high, as we may compute 
Vi G off-line prior to execution. 

V. Single Loop Self-Triggered MPC 


oo /p -1 ^ 

Z [Mk + i + r-p + l)\\% 

-h ||u(fc-hi-hr = ||x(fc-hi)||p(p) 

and the minimizing control signal characterizing lA{i)) is given 
by 

u{k -\- i r ■ p) = —L^P'^x{k + i + r ■ p). 

In the above. 


Having presented the receding horizon cost, we now con¬ 
tinue with formulating its implementation in further detail. We 
will assume that 0 < g, 0 < P and that the down-sampled 
pair {A^P\ B^P'>) is controllable. Let us also assume that the 
finite and non-empty set 1° C N+ is given and let 

7 = maxl°. (9) 

From this we may use Theorem [1] to compute the pairs 
Vi G 1° and formulate our proposed receding 
horizon control algorithm for a single loop; 


p(p) = g(p) -p yl(p)'^p(p)y4(p) 

— + N^p'>)L^p'^ 

pip) = + pip)'^ pip) B(p)y^ 

X {A<^P^^ P^P^B^P^ + N^pY ■ 

Proof. The proof is given in Appendix lAl □ 

Using the above results we may formulate the main result 
of this section as follows. 

Theorem 1 (Closed Form Solution). Assume that 0 < Q, 
0 < R and that the pair {A^p\ B^p')) is controllable. Then 

minJ{x{k),i,l{{i)) = - -f ||a;(fc)||^„ (7) 

U{i) i 

where 

pii) = Qi^) p J^ii)^pip) j^ii) 

— P^P'> -I- 

p« = (pw -ppw'^p(p)p(* 

and P^P'i is given by Lemma |2] Denoting the vector of all ones 
in K” as 1 „, the minimizing control signal sequence is given 
by 

U* = {—L^*^x(fc)lf, —L^P^x{k -\-i-\-r ■ p)lj,...}, r G N 


r' 


Algorithm 1. Single Loop Self-Triggered MPC 

1) Af time k = k' the sample x(k') is transmitted by the 
sensor to the controller. 

2) Using x(k') the controller computes 

I{k') = argmm | -h ||a;(fc')llpw 
u{k') = 

3) a) The controller sends u{k') to the actuator which 

applies u(k) = u{k') to ® until k = k' + I{k'). 
b) The network manager is requested to schedule the 
sensor to transmit a new sample at time k = k' -\- 

I{k'). 

Remark 2. Note that the control signal value is sent to 
the actuator at the same time as the controller requests the 
scheduling of the next sample by the sensor. 

Remark 3. The proposed algorithm guarantees a minimum 
and a maximum inter-sampling time. The minimum time is 1 
time step in the time scale of the underlying process m and the 
maximum inter sampling time is 7 time steps. This implies that 
there is some minimum attention to every loop independent of 
the predicted evolution of the process. 

Remark 4. Even though we are working in uniformly sampled 
discrete time the state is not sampled at every time instant k. 
Instead the set of samples of the state actually taken is given 
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by the set T), which assuming that the first sample is taken at 
k = 0, is given by 

V = {x{0),xiI{Q)),xiI{I{Q))),...}. (10) 


VI. Analysis 

Having established and detailed our receding horizon con¬ 
trol law for a single loop we continue with giving conditions 
for when it is stabilizing. Letting A (A) denote the set of 
eigenvalues of A we hrst recall the following controllability 
conditions. 


Lemma 3. 43?!/ The system is controllable if and 

only if the pair (A, B) is controllable and A has no eigenvalue 
A € A (A) such that A ^ 1 and A* = 1. 

Using the above, and the results of Section |IV] we may 
now give conditions for when the proposed receding horizon 
control algorithm is stabilizing. 

Theorem 2 (Practical stability Il35l ). Assume 0 < Q, 0 < R 
and that (A, B) is controllable. If we choose i G <Z N'*' 
and p = p* given by 

p* =max{i|ieI°,VAG A(A) y ^ 1 if X ^ 1} (11) 


and apply Algorithm [7] then the system state is ultimately 
bounded as per: 


a 

— < lim min 

7 fc—>oo igio 


||x(fc)|| 



< 


a 



In the above, 7 is as in m and e is the largest value in the 
interval (0,1] which Vi G fulfils 


(AW - < (1 - 


and is guaranteed to exist. 

Proof. The proof is given in Appendix O CD 

Remark 5. The bound given in Theorem^scales linearly with 
the choice of a. 

Assumption 1. Assume that ^A G A (A) except possibly A = 1 
such that |A| = 1 and the complex argument ZX = ^ • n for 
some n G N’*'. 


Lemma 4. Let Assumption Q] hold, then p* = 7 . 

Proof. The proof is given in Appendix iDl □ 

Corollary 1 (Asymptotic stability of the origin). Assume 0 < 
Q, 0 < R and that (A, B) is controllable. Further assume that 
either Assumption\J]holds or a = 0. If we choose i GlP G N’*' 
and p = p* given by ( 1771 ) and apply Algorithm |7] then 

lim x(k) = 0 

k—foo 

Proof. The proof is given in Appendix |E] □ 

From the above results we may note the following. 

Remark 6. If the assumptions of Theorem \2\ hold, then 
Corollary |7] can be used to ensure asymptotic stability of the 
origin, except in the extremely rare case that the underlying 
system (A, B) becomes uncontrollable under down sampling 


by a factor 7 , see Lemma |3] Otherwise one may use Lemma 0 
to re-design Tfi giving a new value of 7 which recovers the 
case p* = 7 . 

Remark 7. It is worth noticing that the developed framework 
is not limited to just varying the time to the next sample 
i as in Fig. \2\ One could expand this by using a “multi- 
step” approach wherein the inter sampling time is optimized 
over several frames before reverting to constant sampling with 
period p. 


VII. Extension to Multiple Loops 

Having detailed the controller for the case with a single 
loop on the network and given conditions for when it is 
stabilizing we now continue with extending to the case when 
we control multiple loops on the network, as described in 
Fig.ffl The idea is that the controller C now will run s such 
single loop controllers described in Algorithm [T] in parallel, 
one for each process Vi, 7 G L = {l,2,...,s}, controlled 
over the network. To guarantee conflict free communication on 
the network the controller C will, at the same time, centrally 
coordinate the transmissions of the different loops. 


A. Cost Function 

We start by extending the results in Section |ni| to the case 
when we have multiple loops. The cost function we propose 
the controller to minimize at every sampling instant for loop i 
is then 




2-1 


00 / Pl — l 


00 ! Pi —L , 

r-—n \ 7—n V 


r =0 \ l—O 

\\ui{k-\-i-\-r ■ pi)\\ 


derived in the same way as (0 now with G M'*", 0 < 

0 < Re and period pe G N'*' specific for the control of process 
Ve given by o. From this we can state the following. 

Definition 2. We define the notation in the multiple loop case 
following Definition [7] For a matrix Ei, e.g., Ae and Qe, we 
denote (Ef) by 7^7 • 

Theorem 3 (Closed Form Solution). Assume that 0 < Qe, 
0 < 7?^ and that the pair is controllable. Then 

mmJeixeik),i,lle{i)) = ^ + \\xeik)f(.) 


where 


p]*) = Qf + Af'^P^^^'^Af - + Nf^)Lf 

Lf = {Rf + Bf^ P^^‘^ Bfy\Af^ P^^^^ Bf + 
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and 

p(p^) _ n^p^) I j(p^) 

X pI^^'>B^ l‘^ P N'f^y. 

Denoting the vector of all ones in K" as 1„, the minimizing 
control signal sequence is given by 

Ul = {-L^^"^xe{k)lJ,-L^l‘'’xe{k+i+r-pe)lp^,...}, r e N. 

Proof. Application of Theorem[T]on the individual loops. □ 

B. Multiple Loop Self-Triggered MPC 

To formulate our multiple loop receding horizon control law 
we will re-use the results in Section |V] and apply them on a 
per loop basis. 

For each process P^ with dynamics given in ([T]) let the 
weights a I € R'*', 0 < 0 < and period pi £ N’*' 

specihc to the process be defined. If we further dehne the 
finite set 1° C N'*' for loop i we may apply Theorem [3 to 
compute the pairs {P^''\ L^i^) Vi € 1°. Provided of course 
that the pair {A^^‘\ is controllable. 

As discussed in Section |II] when choosing i the controller 
must take into consideration what transmission times other 
loops have reserved as well as overall network schedulabil- 
ity. Hence at time k = kg loop £ is restricted to choose 
i £ Ii{kg) C where X({ke) contains the feasible values of 
i which gives collisions free scheduling of the network. Note 
that at time k = ke the control input ug, and time for the next 
sample Ig only have to be computed for a single loop £. How 
Ig(ki) should be constructed when multiple loops are present 
on the network is discussed further later in this section. 

We may now continue with formulating our proposed algo¬ 
rithm for controlling multiple processes over the network. The 
following algorithm is executed whenever a sample is received 
by the controller. 

Algorithm 2. Multiple Loop Self-Triggered MPC 

1) Af time k = kg the sample xg{kg) of process Vg is 
transmitted by the sensor Sg to the controller C. 

2) The controller C constructs Xg(kg). 

3) Using xg{kg) the controller C computes 

Igikg) = arg rnin — -f \\xg{kg)\\\i^ 
ug{kg) = -L[^^^'-'^'^xg{kg). 

4) a) The controller C sends ug{kg) to the actuator Ag 

which applies ug{k) = ug{kg) to (0 until k = 
kg + Ig{kg). 

b) The Network Manager is requested to schedule the 
sensor Sg to transmit a new sample at time k = 
kg + Igikg). 

Note that at time k step 3 only needs to be performed for 
loop 1. 


Remark 8 . When the controller is initialized at time k = 0 
it is assumed that the controller has knowledge of the state 
xgiff) for all processes Vg controlled over the network. It will 
then execute Algorithm \2\ entering at step 2, in the order of 
increasing loop index 1. 

C. Schedulability 

What remains to be detailed in the multiple loop receding 
horizon control law is a mechanism for loop £ to choose Ig{kg) 
to achieve collision free scheduling. We now continue with 
giving conditions for when this holds. 

First we note that when using Theorem [3] we make the im¬ 
plicit assumption that it is possible to apply the corresponding 
optimal control signal sequence U^. For this to be possible 
we must be able to measure the state xg{k) at the future time 
instances 

Sg{kg) = {kg + Ig{kg), kg + Ig{kg) +pg, 

kg P Igikg) P “^pg, kg -f Igikg) P Upg, ...}. (12) 

Hence this sampling pattern must be reserved for use by sensor 
Sg. We state the following to give conditions for when this is 
possible. 

Lemma 5. Let loop £ choose its set Xgikg) of feasible times 
to wait until the next sample to be 

Igikg) = {i £ Ig\i k"q’" -kgPn-pq-m-pg, 

m, n S N, g G L \ {£{{ 

where k"^^' is the next transmission time of sensor Sq. Then it 
is possible to reserve the needed sampling pattern Sgikg) in 
m at time k = kg. 

Proof. The proof is given in Appendix |F] □ 

Constructing Igikg) as above we are not guaranteed that 
Igikg) ^ 0. To guarantee this we make the following assump¬ 
tion. 

Assumption 2. Assume that for every loop £ on the network 
2^ = and pg = p. Further assume that L C and 
max L < p. 

Theorem 4. Let Assumption |2] hold. If every loop £ chooses 

Igikg) = {i£ ^ -kgPr-p,r£Z,q£]L\{£}} 

all transmissions on the network will be conflict free and it 
will always be possible to reserve the needed sampling pattern 
Sgikg) in ^2}. 

Proof. The proof is given in Appendix |0 □ 

Remark 9. The result in Lemma |5] requires the reservation of 
an infinite sequence. This is no longer required in Theorem 0 
as all loops cooperate when choosing the set of feasible times 
to wait. In fact loop £ only needs to know the current time kg, 
the period p and the times when the other loops will transmit 
next V g G L \ {£} in order to find its own value Igikg). 

Remark 10. If Assumption holds and every loop on the 
network chooses Igikg) according to Theorem^ then it is 
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guaranteed that at time kf we can reserve ( 1721 ) and that no 
other loop can make conflicting reservations. Hence at time 
ki + Ii{ke) the sequence 

Si{kt + h(ki)) = {kt + hike) + p, 

ke + hike) + 2p, ke + hike) + 3p,.. 

is guaranteed to be available. Thus p S leike + hike))- 


TABLE I 

The pre-computed control laws with related cost eunctions 

AND INTERMEDIATE VARIABLES. 


i 

aO) 

B(i) 

QO) 

rO) 

TVV) 

lO) 

p(i) 


1 

1 

1 

1 

1 

0 

0.70 

1.70 

0/1 + pV) . 3,2 

2 

1 

2 

2 

3 

1 

0.46 

1.73 

a/2 + pV) . 

3 

1 

3 

3 

8 

3 

0.35 

1.89 

a/3 + P(®) • 

4 

1 

4 

4 

18 

6 

0.28 

2.08 

a/4 + P(4) . 

5=p* 

1 

5 

5 

35 

10 

0.23 

2.30 

a/5 + P(*5) . P 


D. Stability 


A. Single Loop 


We continue with giving conditions for when the multiple 
loop receding horizon control law described in Algorithmic is 
stabilizing. Extending the theory developed Section |V] to the 
multiple loop case we may state the following. 

Theorem 5. Assume 0 < Qe, 0 < Re and that iAe,Be) is 
controllable. Further let Assumption^hold. If we then choose 
i G leik) C c with Xeik) chosen as in Theorem^ 
and p = p* given by 

p* =max{7|7 gI°,V7 VA G A(A^) ^ I if \^\} (13) 


and apply Algorithm^ then as k ^ oo 


ae 


ote 


ae 


— < min —+ |k^(fc)||lw <—-(1-e^)- 

7 - ^^Idk)\ % " ^ - ee \P* 7 


where 7 = maxi*' and ee A the largest value in the interval 
(0,1] which Vi G 1° fulfills 




?(0 T (^)\^ Tjip*) { A V) 


(0 rlOt 


ft.) 


Proof. The proof is given in Appendix iHl 


□ 


Corollary 2. Assume 0 < Qe, 0 < 7?^ and that iAe,Be) is 
controllable. Further let Assumption^hold. In addition let 
be chosen so that the resulting 7 = maxi" guarantees that 
Assumption\i\holds for every loop I or alternatively let ae = 0 
for every loop L If we then choose i G Xeik) C l" C N"", 
with Xeik) chosen as in Theorem^ and p = p* given by ( 1731 ) 
and apply Algorithm]^ it holds that 

lim xeik) = 0 . 

k—fco 

Proof. The proof follows from the results in Theorem |5] 
analogous to the proof of Corollary [T] □ 


VIII. Simulation Studies 

To illustrate the proposed theory we now continue with 
giving simulations. First we show how the control law works 
when a single loop is controlled over the network and focus 
on the loop specihc mechanisms of the controller. Secondly 
we illustrate how the controller works when multiple loops 
are present on the network and focus on how the controller 
allocates network access to different loops. 


Let us exemplify and discuss how the controller handles the 
control performance versus communication rate trade-off in an 
individual loop. We do this by studying the case with a single 
system on the network. The system we study is the single inte¬ 
grator system which we discretize using sample and hold with 
sampling time Is = 1 s giving us a;(fc -f 1) = Axik) + Bufk) 
with iA,B) = (1,1). Since we want the resulting self- 
triggered MFC described in Algorithm [T] to be stabilizing 
we need to make sure that our design fulhls the conditions 
of Theorem |2] If we further want it to be asymptotically 
stabilizing we in addition need it to fulhl the conditions of 
Corollary [T] 

The design procedure is then as follows: First we note that 
the system (A, B) is controllable. The next step is to decide the 
weights 0 < Q and 0 < 7? in the quadratic cost function Q. 
This is done in the same way as in classical linear quadratic 
control, see e.g. ll^ . Here we for simplicity choose Q = 1 
and 7? = 1. We note that the system only has the eigenvalue 
A = 1, fulhlling Assumption [T] so that (fTTT i in Theorem | 2 ] 
gives p* = maxi". Hence Corollary [T] and thus Theorem |2] 
will hold for every choice of l". This means that we may 
choose the elements in I", i.e., the possible down sampling 
rates, freely. A natural way to choose them is to decide on a 
maximum allowed down sampling rate and then choose l" to 
contain all rates from 1 up to this number. Let’s say that we 
here want the system to be sampled at least every 5 • Is s, then 
a good choice is l" = {1,2,3,4, 5}, giving p* = maxi" = 5. 

Now having guaranteed that the conditions of Theorem |2] 
and Corollary [T] hold we have also guaranteed that the 
conditions of Theorem [1] are fulfilled. Hence we may use 
it to compute the state feedback gains and cost function 
matrices that are used in Algorithm [T] The results from these 
computations are shown in Table |I] together with some of the 
intermediate variables from Dehnition [T] Here we see that the 
cost functions are quadratic functions in the state x where the 
coefficients are functions of Q and R. We also see that 
the cost to sample a/i enters linearly and as we change it we 
will change the offset level of the curves and thereby their 
values related to each other. However it will not affect the 
state feedback gains. 

A graphical illustration of the cost functions in Table |I] for 
the choice a = 0.2, is shown in Fig. |3] together with the 
curve 7(fc) = argminjo (a;(/c)), i.e., the index of the cost 
function which has the lowest value for a given state x(fc). 
This is the partitioning of the state space that the self-triggered 
MFC controller will use to choose which of the state feedback 







Fig. 3. The cost functions (dashed) together with the partitioning 

of the state space and the time to wait I(k) = arg min 2 ;o (^^(fc)) (solid). 


gains to apply and how long to wait before sampling again. 

Applying our self-triggered MPC described in Algorithm [T] 
using the results in Table |T] to our integrator system when 
initialized in a;(0) = 2 we get the response shown in Fig. |4(a)| 
Note here that the system will converge to the fixed sampling 
rate p* as the state converges. 

It may now appear as it is sufficient to use periodic control 
and sample the system every p* ■ TgS to get good control 
performance. To compare the performance of this periodic 
sampling strategy with the self-triggered strategy above we 
apply the control which minimizes the same cost function Q 
as above with the exception that the system now may only be 
sampled every p* ■ Tg s. This is in fact the same as using the 
receding horizon control above while restricting the controller 
to choose i = p* every time. The resulting simulations are 
shown in Fig. |4(b)| As seen, there is a large degradation of 
the performance in the transient while the stationary behaviour 
is almost the same. By this we can conclude that it is not 
sufficient to sample the system every p* ■ TgS if we want 
to achieve the same transient performance as with the self- 
triggered sampling. 

In the initial transient response the self-triggered MPC 
controller sampled after one time instant. This indicates that 
there is performance to gain by sampling every time instant. 
To investigate this we apply the control which minimizes 
the same cost function (Ell, now with the exception that the 
system may be sampled every Tg s, i.e., classical unconstrained 
linear quadratic control. Now simulating the system we get 
the response shown in Fig. |4(c)| As expected we get slightly 
better transient performance in this case compared to our 
self-triggered sampling scheme, it is however comparable. 
Note however that this improvement comes at the cost of 
a drastically increased communication need, which may not 
be suitable for systems where multiple loops share the same 
wireless medium. 

From the above we may conclude that our self-triggered 
MPC combines the low communication rate in stationarity of 
the slow periodic controller with the quick transient response 
of the fast periodic sampling. In fact we may, using our 
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x{k) 



(a) System response of the integrator system when minimizing 
the cost by using our single loop self-triggered MPC. 

x{k) 



(b) System response of the integrator system when minimizing 
the cost by sampling every 5^^ second. 

x{k) 



(c) System response of the integrator system when minimizing 
the cost by sampling every second. 


Fig. 4. Comparing control performance for different sampling policies. 
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Average sampling interval 



a 


Fig. 5. The average sampling interval for different values of a for the 
integrator system. 

method, recover the transient behaviour of fast periodic sam¬ 
pling at the communication cost of one extra sample compared 
to slow periodic sampling. The reason for this is that the 
fast sampling rate only is needed in the transient while we 
in stationarity can obtain sufficient performance with a lower 
rate. 

B. Comparison to periodic control 

To make a more extensive comparison between the self- 
triggered algorithm and periodic sampling, we sweep the value 
of the design parameter a in the cost function, and run 
100 simulations of 10000 samples for each value. To add 
randomness to the process, the model ([TJ is extended to include 
disturbances such that for process £ 

xi{k + 1) = Aexi{k) + Biue{k) + Eeuje{k), (14) 

where uje{k) ~ A/"(0, a'j) is zero-mean normal distributed with 
variance aj = 0.1 and Eg — 1. We increase the maximum 
allowed sampling interval to p* = maxl^ = 15. This though 
has the effect, that pA) = 1.83 whereas P^^'> = 1.74, such 
that it is always more favourable to have a sample interval 
of 2 samples instead of 1, even when a = 0. We therefore 
changed the cost of control to i? = 0.1 such that we allow for 
larger control signals and force the process to sample at every 
time instance if a = 0 is chosen. The initial state is randomly 
generated as xi{0) ^ A/’(0, , where ^ = 2 500. 

Fig. |5] shows the average sampling interval during the 
simulations for different choices of a for the integrator system. 
The relation between the value of a and the average sampling 
interval is in general monotonic but highly affected by the 
system to be controlled and the statistics of the disturbances 
in the system. 

Since we now have the range of values in which we want 
to sweep a, we present a simple periodic MFC cost to which 
we compare our algorithm. This cost is given by 

OO Ta 

Jt{xi{k),Ug) = EE {\\xt{k + I + rTs)\\Q^ 

r—0 l—O 

+ \\ue{k + I + r'Ts)||^J , 

which using Dehnition[T]and Lemmas [T] and | 2 ] can be rewritten 


Average empiric cost 



Fig. 6. Performance of the integrator using the self-triggered algorithm 
compared to the simple periodic algorithm for different sampling intervals. 
Some values of a are marked with the black crosses. 

as 

OO 

Ji{xt{k),Ue) = \\xi{k+rTs)\\Ys) + \\Mk+rTs)\\Ys) 

r=0 ^ '' 

P2x(,{kPrT,fNf\g(kPrT,) (15) 

for sampling period Ts where 

xg{k P Ts) = A^^‘\g{k) -I- BY^ug{k). 

When Ts = 1, ( fTSl l reduces to classical unconstrained linear 
quadratic control. 

The empiric cost for each simulation is calculated by 
1 

— Y xe{k)'^QgXi{k) + ui{k)'^Rtui{k), (16) 

fc—0 

where T = 10 000 is the length of the simulation. 

Fig. |6] shows the average performance that is obtained for 
different averaged sampling intervals obtained by sweeping a 
from 0 to 10®. Fig.|6]illustrates that the self-triggered algorithm 
performs significantly better than periodic sampling especially 
when communication costs are medium to high. However, at 
sampling intervals of 2 and 3 time-steps the self-triggered 
algorithm performs slightly worse than the periodic algorithm. 
This is caused by the fact, that if the state of the system is close 
to zero when sampled, the cost of sampling is much higher 
than the cost of the state error and control, hence the time until 
the next sample is taken is large. To avoid this phenomenon, in 
future work, one could take the statistics of the process noise 
into account in the cost function. It can further be noticed, that 
when a ^ OO, the performance of the self-triggered algorithm 
is very close to the cost of the periodic algorithm. This is as 
expected, since the cost of a will be greater than the cost 
of the state and control, which will result in sample intervals 
of i = p* = 15. This reduces the self-triggered algorithm to 
periodic control. 

C. Multiple Loops 

We now continue with performing a simulation study where 
we control two systems over the same network. We will 
start by showing a simple example followed by a more 
extensive performance comparison of the Algorithm |2] to 
periodic control. We will keep the integrator system from 
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Section IVIII-AI now denoting it process Vi with dynamics 
xi{k + 1) = Aixi{k) + Biui{k) with = (1,1) as 

before. In addition we will the control process V 2 which is 
a double integrator system which we discretize using sample 
and hold with sampling time Tg = Is giving 


X 2 

xl{k ■ 


(fc- 


1 ) 

1 ) 


(fc) 


x\ 

xl{k) 


+ 


X'zik+l) 


X2{k) 


1 

0.5 

B 2 


U2{k). 


We wish to control these processes using our proposed 
multiple loop self-triggered MPC described in Algorithm |2] 
As we wish to stabilize these systems we start by checking 
the conditions of Theorem |5] and Corollary |2] First we may 
easily verify that both the pairs {Ai,Bi) and (^ 2 ,^ 2 ) are 
controllable. To use the stability results we need Assumption|2] 
to hold, implying that we must choose pi — P 2 = P, 
if =X 2 = I^ and choose I^ such that { 1 , 2 } G 1 ° and 2 < p. 
For reasons of performance we wish to guarantee that the 
systems are sampled at least every 5-TsS and therefore choose 
if = if — I^ = { 1 , 2 ,3,4,5} fulfilling the requirement 
above. We also note that A(Ai) = {1} and A(A 2 ) = {1,1} 
and that hence both system fulfill Assumption[T]for this choice 
of 1°, implying that (fOT l in Theorem |5] gives p* = maxl° = 
5. Thus choosing p = p* as stated in Theorem |5] results in that 
Assumption I 2 ] holds. What now remains to be decided are the 
weights ai, Qf, and Ri. 

For the integrator process Vi we keep the same tuning as 
in Section IVIII-AI with Qi = i?i = 1. Having decided Qi, 
Ri, 1° and p* we use Theorem [3 to compute the needed state 
feedback gains and cost function matrices {P ^'^, Vi G 1 ° 

needed by Algorithmic] We also keep ai = 0.2 as it gave a 
good communication versus performance trade-off. 

For the double integrator process 1^2 the weights are chosen 
to ht Q 2 = I as we consider both states equally important and 
i ?2 = to favor control performance and allow for larger 
control signals. Having decided Q 2 , R 2 , and p* we may 
use Theorem [3] to compute the needed state feedback gains 
and cost function matrices (^ 2 *^X 3 *^) Vi G needed by 
Algorithm |2] The sampling cost is chosen to be 02 = 1, as 
this gives a good tradeoff between control performance and 
the number of samples. 

We have now fulfilled all the assumptions of both Theo¬ 
rem |5] and Corollary Id Hence applying Algorithmic] choosing 
Ii{k() according to Theorem |4] will asymptotically stabilize 
both process Vi and 7 ^ 2 - 

Controlling Vi and V 2 using our multiple loop self-triggered 
MPC described in Algorithmic] with the above designed tuning 
we get the result shown in Fig. |7] As expected, the behavior of 
the controller illustrated in Section lVIITAI carries through also 
to the case when we have multiple loops on the network. In 
fact comparing Fig. |4(a)| showing how the controller handles 
process 7^i when controlling it by itself on the network and 
Fig. |7(a)| which shows how Vi is handled in the multiple loop 
case we see that they are the same. Further we see that, as 
expected, in stationarity the two loops controlling process Vi 
and V 2 both converge to the sampling rate p*. 

As mentioned previously the controller uses the mechanism 
in Theorem |4] to choose the set of feasible times to wait 


xi{k) 
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(a) System response for process Vi 
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(b) System response for process 7^2 


Fig. 7. The processes and 1^2 controlled and scheduled on the same 
network using our multiple loop self-triggered MPC. 


until the next sample. In Fig. |8] we can see how the resulting 
sets Ii{k() look in detail. At time fc = 0 loop 1 gets to 
run Algorithm |C] first. As sensor S 2 is not scheduled for 
any transmissions yet Ii (0) = iP from which the controller 
chooses Ii (0) = 1. Then loop 2 gets to run Algorithmic at time 
A: = 0. As sensor 5i now is scheduled for transmission at time 
A: = 0 -I- /i(0) = 1, Theorem |4] gives X2(0) = lP \ {1}. From 
which the controller chooses /2(0) = 2. The process is then 
repeated every time a sample is transmitted to the controller, 
giving the result in Fig. 0 As seen both the set Ii{ ) and the 
optimal time to wait Ii{-) converges to some fixed value as 
the state of the corresponding process Vi converges to zero. 

D. Comparison to periodic control 

For a more thorough performance comparison, we simulate 
the systems using Algorithm |2] and compare them to the peri¬ 
odic algorithm that uses the cost function (fTSi) . The single and 
double integrator processes {Vi and V 2 ) are simulated using 
the parameters mentioned in Section IVIIII The variance of the 
disturbances for both processes are set to cr| = 0.1, V£, the 
variance of the initial state to ^ = 25, V£ and E 2 = [1, 1]^ 
in (fT4l) . The value of a is identical for both processes in 
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Normalized average empiric cost 
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Fig. 10. Performance for both processes for the self-triggered algorithm 
compared to a simple periodic algorithm for different sampling intervals. The 
max sampling interval p* = 15. This is done by varying a. from 0 to 10^ 
and calculating the average amount of samples for every value of a. 


Fig. 8. The sets X^(-) of feasible times to wait until the next sample for 
loop 1 and loop 2. The optimal time to wait /^(■) is marked by red circles 
whereas the other feasible times are starred. 
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Fig. 9. Performance for both processes for the self-triggered algorithm 
compared to a simple periodic algorithm for different sampling intervals. The 
max sampling interval p* = 5. This is done by varying a. from 0 to 5 • 10“^ 
and calculating the average amount of samples for every value of a. 


each simulation, such that ai = a 2 - Further Ri = 0.1. The 
simulation for both algorithms is initialized as described in 
Remark [8] 

Fig |9] shows the performance for the processes, Vi and 
V 2 , calculated by averaging (O over 100 simulations each 
of length 10000 for different values of a when p\= P 2 = 5. 
Fig. [TOl shows the performance when p\= P 2 = 15. The cost 
of each process is normalized with respect to its largest cost 
for easier viewing. 

Both figures show that the self-triggered algorithm in gen¬ 
eral outperforms periodic sampling. The performance margin 
increases as the sampling interval increases. Fig. |9] shows that 
Vi when a = 0 samples every 1.8 time steps on average, 
whereas V 2 only samples every 2.2 time steps. When a 
increases both processes sample almost every 2 time steps. 
The reason for this is that the cost of the state for the down 
sampled systems in some cases is lower. 

The worse performance of the self-triggered algorithm at 
lower sampling intervals is more significant in Fig. [TO] where 
the performance of process Vi shows a similar performance 
as in Fig. |6] The lowest average sampling interval for process 
V 2 is 3.6 time steps when a = 0. The self-triggered algorithm 
though significantly outperforms the periodic algorithm when 


the average sampling interval increases. 

As a CX3 the cost of sampling forces the self-triggered 
algorithm to behave similar to the periodic sampled algorithm. 
Therefore the performance gap between the periodic and self- 
triggered algorithms narrows, as the average sampling interval 
is close to p*. 


IX. Conclusions 

We have studied joint design of control and adaptive 
scheduling of multiple loops, and have presented a method 
which at every sampling instant computes the optimal control 
signal to be applied as well as the optimal time to wait 
before taking the next sample. It is shown that this control 
law may be realized using MFC and computed explicitly. 
The controller is also shown to be stabilizing under mild 
assumptions. Simulation results show that the use of the 
presented control law in most cases may help reducing the 
required amount of communication without almost any loss 
of performance compared to fast periodic sampling. 

In the multiple loop case we have also presented an algo¬ 
rithm for guaranteeing conflict free transmissions. It is shown 
that under mild assumptions there always exists a feasible 
schedule for the network. The complexity of the multiple 
loop self-triggered MFC and the corresponding scheduling 
algorithm scales linearly in the number of loops. 

An interesting topic for future research is to further inves¬ 
tigate the complexity and possible performance increase for 
such an extended formulation. 

Intuitively, additional performance gains can be achieved 
when the cost function takes process noise into account as 
well. Exploiting this could be of interest in future research. 

Another topic for future research would be to apply the 
presented framework to constrained control problems. 
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Appendix 


A. Proof of Lemma \2\ 

Proof Following Lemma [T] the problem is equivalent to 

OO 

min 7 

U{i) ^ 

^ ' r=0 

+u{k + i + r ■ pf"+ i + r ■ p) 

+2x{k + i + r ■ p)'^N^P^u{k + i + r ■ p) 

with 

x{k + i + {r+l)-p) = A^P^x{k + i + r-p) + B^P'^u{k + i + r-p). 

This problem has the known optimal solution, see e.g. EH, 
\\x{k + i)llp(p)- Where is given by the Riccati equation 
(|6|l, which has a solution provided that 0 < R^p\ implied 
by 0 < i?, and 0 < Q^p\ implied by 0 < Q, and that the 
pair is controllable. Exactly what is stated in the 

Lemma. □ 


^x{k + i + r ■ p)'^Q^P^x{k + i + r ■ p) 


B. Proof of Theorem Q] 

Proof From the Theorem we have that 0 < Q, 0 < i? and 
that the pair {A^p\B^p^) is controllable. Thus we may use 
Lemma |2] to express the cost (|4| as 

J{x{k),i,U{i)) = - + ||a;(fc + i)||Lp) 

% 

+ ^ f ll^(^ + OIIq + • 
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Now applying Lemma [T] we get 

J{x{k),i,U{i)) = - + ||a;(fc + z)|||(p) 

% 

+ x{kfQ^^x{k) + u{k)'^R^^u{k) + 2x{kf u{k). 

with x{k + i) = A^'^^x{k) + B^^^u{k). Minimizing 

J{x{k),i,k({i)) now becomes a finite horizon optimal control 
problem with one prediction step into the future. This problem 
has the well defined solution 0 , see e.g., Gzl, given by 
iterating the Riccati equation ®. □ 


C. Proof of Theorem |2] 

Proof By assumption {A, B) is controllable. Together with 
the choice ofp* this, viaLemma [3 implies that {A^p‘\ B^p"')) 
is controllable. Further let x{k'\k) denote an estimate of x(fc'), 
given all available measurements up until time k. Defining 

ll^(fc|A)lll« " E + ^i^)IIq + ii*(fc|fc)ii«) (17) 

1=0 ^ ' 

we may, since by assumption 0 < Q and 0 < i?, use Lemma|2] 
and Theorem [T] to express 14, the optimal value of the cost 
@ at the current sampling instant k, as 

14— min J(x(k),i,u{k\k)) 

,u{k\k) 

= min - + \\x{k + i|fc)||p(p.) + ||x(fc|fc)|||(,) (18) 

zGxU I 

= min- + p(fc|fc)||^„. 

i^X^ % 


Since e G (0,1] this is equivalent to 

which as I ^ oo gives us an upper bound on the cost function, 
Vk+i' < /3/e- Applying this idea on our setup we should fulfil 

P I 

- ~ *l*)llp(p*) - 4Hk\k)\\l(i) + P. 

Choosing /3 = a/p* — (1 — e)a/^ we have fulfilment if 

e(||i(fc + /|/c)||^(p.) + p(/c|fc)|||(,)) < ||x(fc|fc)|||„. (20) 

Clearly there 3e G (0,1] such that the above relation is fulfilled 
if 0 < ||i;(fc|fc)|||(i). For the case ||a/(fc|fc)|||(i) = 0 we must, 
following the definition (fTTl i and the assumption {) < Q, have 
that x{k\k) = 0 and x{k + i\k) = 0 and hence the relation 
is fulfilled also in this case. Using the final step in (fTSl l we 
may express (l 20 l l in easily computable quantities giving the 
condition 


||i(fc + i|fc)||^(p.) < (1 - e)p(fc|fc)||^(,). 

As this should hold Vx and Vz G TP we must fulfill 

(^(0 _ PPP [A^^ - B^L^^) < (1 - e)pW 

which is stated in the Theorem. Summing up we have 


V) 


k+V 




We will use Vk as a Lyapunov-like function. Assume that Vk+i 
is the optimal cost at the next sampling instant k + i. Again 
using Theorem [U we may express it as 

14 +i = min Jixik + i), j,u{k + i\k + i)) 

j^X^ ,u{k-\-i\k-\-i) 

< min Jixik + i),j = p* ,u(k + i\k + if) 

~ u{k+i\k+i) 

® ii-/, .1, .mi2 (^^^ 

-f z A; -f z) p(p.) 

p* 

= ^ + P(fc + z|fc)||^(p.,. 

p* 

Where the inequality comes from the fact that choosing j = p* 
is sub-optimal. Taking the difference we get 

Vk+^-Vk<^---\\mk)\\li0 

p* I ° 

which in general is not decreasing. However we may use the 
following idea to bound this difference: Assume that there 
3 e G (0,1] and /3 G M"*" such that we may write 

Vk+i — Vk P —PPk + P, 

for all fc, fc -I- z G V, see (fTSl l and (doj. Thus, at I sampling 
instances into the future, which happens at let’s say time k+l', 
we have that 

l—i 

Vk+v < (l-e)'-Ufe+^-E(l-e)'- 

r =0 


which is minimized by maximizing e. From the definition of 
the cost 0 we may also conclude that a/^ < 14+;/. With 

Vk+i' = min (- + \\x{k + l'\k + /')llp(i)l 

iGl” \ Z / 

we may conclude that 

— < lim min (^ + l|a/(A:|fc)||p(i)^ < — f ^ - (1 - e)-^ • 
7 k=too ieio ft ^ J e \p 7 J 

□ 


D. Proof of Lemma |4] 

Proof From it is clear that p* = 7 if ^ A G A (A) except 
A = 1 such that A^ = 1. In polar coordinates we have that 
A = |A| exp(j • ZA) implying A'’' = lAp exp(j • 7 • ZA) = 1 
may only be fulfilled if |A| = 1 and ZA = ^ • n for some 
n G N’*', which contradicts Assumption [T] □ 


E. Proof of Corollary Q] 

Proof From Theorem |2] we have that as k ^ 00 


a 

— < mm 
7 ieio 


(f+ ll^(fc)ll 



< 


a 



which as p* = 7 , given by Lemma @1 simplifies to 


a 

— < min 
7 ieio 


(Y + ll^(fc)llp«) 
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independent of e. Implying that as k ^ oo we have that 1 = 7 
and ||a;(fc)||p(i) = 0, since 0 < provided 0 < Q this 
implies x{k) = 0 independent of a. In the case a = 0 the 
bound from Theorem |2] simplifies to that as k ^ 00 

0 <mm||x(fc)||^„ < 0 

and hence for the optimal i we have ||x(fc)||p(i) = 0 implying 
x{k) = 0 as above. □ 

F. Proof of Lemma |5] 

Proof When loop q was last sampled at time kq < kg it 
was optimized over Xqfkq) and found the optimal feasible 
time until the next sample Iqikq). The loop then reserved the 
infinite sequence 

^qifq) ~ Iq{kqf kq -f Iq{k'^ -f Pg, 

kq “t“ fg(Alg) “t“ ‘2,pq^ kq “t“ Iqfkq^ “t“ ‘^Pqi • ■ 

When loop £ now should choose Ii{ki) it must be able to 
reserve (fT2li . To ensure that this is true it must choose Ie{ki) 
so that Si{ki) n Sq{kq) = 0, Vg G L \ {£}. This holds if we 
have that 

ke + Ie{ke) +m-pi kq + Iqikq) + n • pg, 

VTO,nGN, VgGL \ {£} 

Simplifying the above condition we get conditions on the 
feasible values of hikf) 

h{kt) f [kq + Iqfkq)'^ - ke + U ■ Pq - TTl ■ Pf, 

Vm,nGN, VqGL\{£} 

Noticing that [kq + Iq{kq)^ is the next transmission of loop q 
we denote it giving the statement in the lemma. □ 

G. Proof of Theorem |4] 

Proof Assuming pe = p and 2^ = 1° for all loops Lemma |5] 
gives leife) as stated in the theorem. Since by assumption 
L C 2*^ we know that 

Ie{ke) A {i G L|i ^ + r • p, r G Z, q G L \ {£}}. 

Further, since maxL < p we know that the set 
{i G L|i = fc™’" — + r • p, r G Z} 

contains at most one element for a given loop q. Thus, 

{i G Ml f -A:^ + r-p, rGZ, gGL \ {£}} 

contains at least one element as g G L \ {£} C L. Hence 
Ii{ke) always contains at least one element \/ke, and thus 
there always exists a feasible time to wait. □ 

H. Proof of Theorem |5] 

Proof Direct application of Theorem |2] on each loop. The 
corresponding proof carries through as the choice of leik) 
together with the remaining assumptions guarantees that we 
may apply Theorem[3]on the feasible values of i in every step, 
thus the expression for I 4 in (fTSl l always exists. The critical 
step is that the upper bound on Vk+i in (fT^ must exist, i.e., 
the choice j = p* must be feasible. This is also guaranteed 
by the choice of Xe{k), see Remark fTOl □ 


